Corpus

Column

Introduction

The corpus that will be used for this project is the top 50 of multiple countries on Spotify that has an available top 50. This is an interesting corpus because it can show the differences and similarities between popular music in many different places across the world. The comparison points can be either countries, regions, or continents. This flexibility is another advantage of the corpus. The limitations of this corpus are the fact that recent releases or events can influence music worldwide and can push music to the top of many top 50 lists while not saying much about the differences between places (such as the recent release of Kanye West having multiple songs in multiple different top 50 lists). Another limitation is the fact that multiple countries might not use Spotify as their main source of music, so this could influence regional trends in a way that makes them not representative of the population of the country as a whole. A final weakness is the fact that this corpus changes daily, so it can be hard to look into individual tracks since they will probably be gone after a certain time. Finally, it is difficult to pick specific interesting tracks in this corpus due to the sheer number of tracks (3600) in it. But some that may be interesting at first glance are CARNIVAL (Kanye West), Cruel Summer (Taylor Swift), Unwritten (Natasha Bedingfield) and I Wanna Be Yours (Arctic Monkeys). This Kanye West song and many others of his (very) recent album are interesting because they can be found in nearly every top 50 list. The other three songs are mostly interesting because they are older songs that are still found in multiple top 50 lists across the corpus.

Outlier analysis

Row

Release date of the songs

Plot A

This plot shows PLACEHOLDER

Plot B

This plot shows the relation between energy and valence in music from all the different top 50s, it also shows which top 50 every datapoint is from using colors typically associated with the countries that are used. It shows that energy and valence are correlated, but not as much as you might expect. It also shows some regional trends.

Row

Plot A

Plot B

Regional differences

Description

Some text discussing all the different plots.

Column

Acousticness

Danceability

Energy

Liveness

Loudness

Speechiness

Tempo

Valence

Individual songs

Row

Description

This is a chroma graph of an outlier in the dataset: “Cruel Summer” by Taylor Swift, which is one of the oldest songs which is found in multiple top 50 lists.

Column

Chroma analysis of an outlier

Row

Structure similarity matrix of an outlier

Description

These are two self similarity matrices of “Cruel Summer” by Taylor Swift, these similarity matrices show both Chroma and Timbre.

Row

Description

Chordogram